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System and method for audio signal processing 



This invention relates to the field of sound reproduction, and more particularly 
to the field of digital audio signal processing. 

The invention relates to a sound reproduction system comprising a digital 
audio signal inpul^ a digital audio signal processor and a digital audio signal output. 
5 The invention also relates to an audio signal processor for processing an 

incoming audio signal in an audio output signal In particular the invention relates to a digital 
signal processor (DSP) circuit or program. 

The invention also relates to a method for processing a digital audio signal 

10 

A sound reproduction system, such as e.g. a loudspeaker telephone system, 
includes an output transducer, often called a loudspeaker, and an input jSir an audio signal 
The loudspeaker produces sound pressure waves in response to the audio input signal M^iich 
is representative of a desired sound pressure wave. 

15 Intelligibility of the sound as perceived by the listener is very important, 

especially in noisy environments. The simplest way to increase the intelligibiKty is to 
increase the average SPL (Sound pressure level), i.e. turning up the volume. However, sunply 
turning \xp the volume does not always lead to a more intelligible sound Also, too high an 
output may lead to an overloading of a loudspeaker resulting in a furfher decrease of the 

20 intelligibility. 

A number of attempts have been made to increase llie intelligibility of sound 
United States Patent qjplication US 2002/0015503 proposes e.g. to increase 

intelligibility by individually constraining the gain Actors for different fi:eqaency bands. 

The existing systems and methods are, however, either very complicated, 
25 requiring complicated coroputations and thereby complicated circuitry (hard-ware), or, in the 

case of a program (soft-ware) being used, a complex program, or supply only a limited 

advantage. 

Notwithstanding the above-mentioned referraces, there continues to exist a 
need in the art for improved systems and methods providing improved intelligibility. 
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It is an object of the present invention to provide a sound reproduction system 
and meliiod with in^ro ved intelligibility. 

To this end, the sound reproduction system in accordance with the invention is 
characterized in that the systeax comprises a digital signal processor, the digital signal 
processor comprising a high pass QIP) fflter with a pass frequency of between a first 
ftequency and a second frequency, preferably of between 300 Hz and 2 kHz, a compressing 
anqjlifier (22) for compression and an^jlification of a signal, at least anq)lification (gain) 
being performed after HP fiOltoing, and a dipper jfor clipping the HP ffltered, conqiressed and 
amplified signal above a clipping leveL 

The system in accordance with the invention is based on the following 

insights: 

1 . The incoming signal is amplified to increase the loudness, however 

2. increasing the incommg signal could lead to a signal higher than the maximum 
digital signal, m such cuxjumstances the signal would often be digitally clipp ed, leading to a 
distortion of the signal. 

3- Low fi:equenci6S are removed fit>m the signal, this allows the remainder of the 
signal to be amplified with, on average, a higher gain fector. This is done by the HP filter, 
situated before amplification. The lower frequencies are, m so fir as intelligibility is 
concerned, of relatively low importance. The lower frequencies comprise much of the 
amplitude of the signal, so ronoving the lower frequencies strongly reduces the amplitude of 
the signal, creating head room for amplification, Le. a stronger amplification for the 
remainder of the signal A large part of the amplitude of especially speech is conqirised m the 
lower frequencies so attenuating lower frequencies allows for a considerable increase in head 
room (i.e. anq)lification without hitting a clippmg level). 

4- Simply cutting the lower fiequencies and increasing tiie output would increase 
loud sounds as mudh as the more subdued sounds. Although tins leads to an improvement in 
intelligibility, this does not always results in a readily perceivable uicrease in intelligibility, 
especially not when tiie loudness range, i,e. the compressed amplitude range is large. To 
further mcrease intelligibility the device comprises a compressing amplifier for amplification 
and compression of a signal. The amplitude range is then compressed (as well as amplified). 
The clippo:. clips the signal above a clippmg level, the clipping level is situated within tiie 
compressed amplitude range, whereby the average loudnei^ difference is reduced. The more 
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3 19.01^004 
subdued sounds undergo a stronger amplification than the louder sounds Oeading to a 
compression of the signal to within a compressed ampUtude range) which increases 
intelligibiUiy, because the more subdued sound become better distinguishable. This is done 
by the con^iressing amplifier. Any desirable functional finm relating signal amplitude in to 
signal amplitude out may be implemented as compression relationship, e.g. witii a lookup 
table. Hie loudest sounds are dipped by setting tiie clipping level within the compressed 
ampUtiide range, &rther increasiiig intelKgibiliiy. Hbs is done by the clipper. Ths signal is 
clipped at a certain amplitude (clipping level) witiiin tiie compressed anq)litude range of 
which die limits, in particular die upper limit; may be set at initialization time. ITie 

compression action may at least in part be eflEectuated before or after HP filtei^ 
amplification as such (i.e. die bulk of the gain) is performed after HP filtering since it is the 

removal of flie lower frequency components by die HP filtering which aflows for die on 
average higher gain of flie remainder (i.e. the increase in headroom). 

The concept of clipper should not be understood as limiled to a hard cHpping 
(ie. all values above a predetermined vahe C are set equal to C), but also conq)Tise soft 
dippms, which m a subrange up lo die high limit of the compressed amplitoade range, applies 
another predetermined signal amplitude in/ signal amplitude out relationship. 

Plreferably die campressmg amplifier is arranged not to amplify a signal 
havirig a signal strength below a threshold vahe. 

Below a tiireshold vahie (a certain ndmmum amplitude) die signal is probably 
due to noise. Not amplifyii^ such signals improves inteUigibility. since the noise is reduced. 
Furihemiore die difference between silence and speech is better distinguishable, which also 
mcreases intelligibilily. The direshold may be set at initialization time 

Preferably die digital sound processor conqxcises a low pass filter for filtering 
die signal provided by die compressing amplifier and for providmg an ou^ signal, die pass 
frequency of die low pass filter tying in die range 2 kBfc-Fs/Z, where Fs is die san^le 
frequency. „ 

The congiression action, but m particular die clippmg action may introduce 
disturbmg overtones, which reduce the natnrahiess and inteUigibility of die emitted sound. 

Ftohemwre due to die use of die high pass fiHer, die signal already comprises a relatively 
high proportion of high frequency tones. The low pass filter cuts out or at least reduces ti 
overtones produced by die compressing amplifier leading to a more natural sound and 
inoeasing die intelligibility. 
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The high pass filter is preferably a first order or second order filter, i.e. a filler 
with a relatively gradual slope. It is advantageous to remove much of the energy of the low 
fiequency components of the incoming sigoal to provide head room for Ibe amplification. 
However, a filter with a slope tbat is relatively steep (a step filter being the most extreme 
example of such a filter) removes so much of the lower fiequency con^onents that this may 
result in an unnatmal sounding voice. Preferably the device comprises a means for wuibling a 
user to change the orda^or die cut/off fiequency. Using a order high pass filter results 
in good speech intelligibiUty and/or signal loudness, whereas using a 1^ order high pass filter 
willpresMve the more natural sound of the original signaL 

Preferably the system, and/or program comprises a means for activation and/or 
setting of the fiequency dependence of the low pass filter and/or the high pass filter in 
dependence on the average amplification in the leveling an^ilification stage. The average 
amplification is a measure fijr the average gain of the signal and thereby of the loudness level 
of the emitted sound signaL It is advantageous when the cut-off frequency of the high-pass 
filter increases with increasing average loudness level of the emitted signal, and the cut<rfF 
ftequenqr of die low pass fiher deoKsases with increasing loudness level 

At very high amplification levels (as could h^^pen when the device is used in 
loud, noisy envimnments) the leveling action is relatively large, this relatively large leveling 
action inlioduces a relatively large distortion of flie signal, i.e. the unwanted, unnatural 
overtones of the signal comprise much energy. This leads to a harsh sound. The haish^ 

liffi sound sometimes and even often, as the inventors have realized leads to die listener 

keeping the loudspeaker, especially in mobile tBlq)hones, at some distance from the ear. 
Apart fiom the fiict tiiat holding die loudspeaker at some distance firom die ear m itself wi^ 

lead to a considendjle reduction in signal to noise, since die signal wiU be reduced and die 
noise will be increased, flw mere feet diat the sound itself is perceived as harsh in feet means 
a reduction in inleUigibiUty of die message given. For vocal messages die harshness of tiie 
voice often is an integral part of die message which die person speaking wishes to address die 
Ustener, sometimes even more important dian die actual words of die messages. It is dius 
important fiir die inteUigibility of die message, seen in a concept broader dian merely 
whedier or not words are understood, tiiat a dear "natural" voice transfer is achieved. At 
lower amplification levels die "harsh sound" effect is much less audible. Selectively 
activating or setting die cutoff fiequency at a relatively low fiequency of die low pass filter 
at high amplification levels reduces die "harsh sound effects, while at low amplifications 
levels die need for use of die high pass filter is less, and not activating die high pass filter 
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5 19.01^004 
may actually lead to a more natural sound. In short, at high amplification levels and strong 
leveling action a relatively large part of the amplitude at high fiequencies is due to artificial 
artifects due to the leveling action and removing (in toto or partiaUy) the higher frequencies 
ranovBs the arti&cts thus leading to a more natural sound, at relatively low amplification 
levels and relatively moderate levelmg action, considerably more of the signal an^litude at 
high frequencies is of natural origm and much less due to artificial artifects, so not using the 
low pass filter (or setting ihe cut-off frequency at a relatively high frequency is then 
preferable. By selectively activating the higji pass filter dependent on the amplification, an 
improved sound rqpioduction is possible. lastead of activating/deactivating the low pass filter 
(i.e. an on/off choice) the fiHer diacacteristic of the low pass filter may be made amplification 
dependenii fiw mstance having a filter cutofFfrequency of 2 kHz for high amplification (loud 
environmental noise) and Fs/Z, or e.g. 4 ]sHz, for low amplification or no environmental 
noise, or m other words wherein the cut-off frequency decreases as the loudness level 
inoreases. 

Alternatively the device may have a means for manually setting the filter 
jsharacteristics of the low pass filter, e.g. a knob enabling the user to set die filter cut off 

quency to 2, 3 and 4 Isflz depending on the (lack of) intdligibiUty, or personal preference. 
A sunilar argument holds fijr the high pass filter, at low anqilification levels the need for 
usmg Ihe high pass fiOter is less than at high ampUfication levels, so at low amplification 
levels the high pass filter m^ be deactivated or given a low cut-off frequency level, whereas 
at high amplification level (high loudness) die high-pass filter may be activated or ite cut-off 
fiiequency set at a relatively hi^ valu^ or in other words wiiraein Ihe cut-off frequency of 
Ihe high pass filter increases as the loudness level increases. 

In a first embodimoit flie system comprises die high pass filter followed by an 
AGC followed by a limitec/clipper preferably followed by the low pass filter. This 
embodiment is prefoced viAiea in circumstances where signal loudness is of prime mterest A 
limiter scans for peaks in the audio signal and attenuates the audio portion around the peak if 
the attenuation is necessary to limit the amount of clipping, while yet for veiy loud signals 
allowing for clipping. 

hx a second embodiment, the system courses an automatic vohame levder 
preceded or preferably foUowed by the highpass filter, providing a leveled signal, foUowed 
by a gam, and a cUpper, preferably followed by the low pass filter. This embodiment is 
preferred when low computational effort is preferred 
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6 19.01:2004 
(Hard) clipping is a single operation in which any signal above a threshold 
signal slreajgth is reduced to said given Ihrrahold signal strength, Le. a maximum signal 
strength is set. The advantage of such an embodiment is that a single device is used, the 
disadvantage is tiiat the signal is more heavay distorted, since any details in the signal above 

the threshold signal are lost 

In preferred embodiments the device conqnises a measuring device, sudi as a 
microphone, for measuring badkground noise levels. The device comprises an adaptor for 
acbpting one or more of the parameters in dependmce on the measured background noise 
level Such parameters are e.g. the cut-off freqoemgr and order of the high-pass filter, the cut- 
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:s the depmdency on the measured noise level is 



Preferably for one or more of tbe parai 
non-linear. 

Within liie concept of the invention a 'clipper',, 'compressor*, 'amplifier', 
'filter', 'converter', 'comparator' etc are to be broadly understood and to con^rise e.g. any 
piece of hard-ware (such as a cKpper, compressor, amplifier etc), any circuit or sub-circuit 
designed for perfiraning a clqiping, compression, aniplification etc fianction as described as 
weU as any piece of soft-ware (computer program or sub program or set of compute 
programs, or program code(s)) designed or programmed to perform a cUpping, compressing, 
filtering etc operation in accordance wilh the invention as well as any combination of pieces 
of hardware and software acting as such, alone or in combination, without being restricted to 
the below given exemplary embodiments. One program may combine sevral functions. 

The invention is also ernbodied in any coxnpxxtea: program comprising program 
code means for p^foiming a method in accordance with the invention when said program is 
run on a computer as well as in any computer program product conqmsing program code 
means stored on a computer readable medium for performing a method in accordance with 
the invention when said program is run on a computer, as well as any program product 
comtprisiag program code means for use in a telq)hone system in accordance with the 
invention, for performing the action specific for the invention. 



30 



Iliese and fiirther aspects of the invention will be explained in greater detail 
by way of exainple and with reference to tiie accompanying drawings, ui which 

Fig. 1 is a schematic diagram of a system including a loudspeaker, and a DSP. 
Fig. 2 shows schematically a DSP ia accordance with the invention 
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Fig. 3 shows two examples of high pass filters which may be used in the 



inventioii. 



Fig. 4 shows the wavefoim of a typical speech signal. 
Fig. 5 illustrates the waveform after having been filtered by the high pass filter 
S Fig. 6 shows the spectrum of the waveform shown in figures 4 and 5. 

Fig. 7 illustrates one type of embodiments of the invention. 
Fig. 8 depicts the functional block diagram of an exemplary AGC as 
schematically shown in Fig. 7. 

Fig. 9 illustrates a di£ferent type of embodiment 
10 Fig. 10 illustrates in more detail an exemplaiy AVL element as schematically 

indicated in Fig. 9 

Fig. 1 1 illustrates the AVL behavior of the AVL of Fig. 10. 
Fig. 12 shows an example of a compression fimctional relationship 
Fig. 13 illustrates a preferred embodiment in which parameters are adapted in 
IS dependence on a measured noise level. 

Fig. 14 illustrates first, second and third order dependence of a parameter P on 

noise level S. 



20 The present invention wiU. now be described more fully hereinafter with 

reference to the accompauying drawings, in which preferred embodiments of the present 
invention are shown. This invention may, however, be embodied in many different forms and 
should not be construed as limited to the embodiment set forth herein; rather, these 
embodiments are provided so that this disclosure will be thorough and conq)lete, and will 

25 fully convey the scope of the invention to those skilled in the art Like numbers refer to like 
elemmts throughout 

Fig. 1 illustrates schematically a sound reproduction system. Such system can 
for instance be a hands-fijee loudspeaker cellular radiotelephone for use in an automobile. 
When implemented as a hands-fi:ee cellular telephone, speech signals received fix)m a fer 

30 end, i.e. fi:om a distant party, are transmitted &om a cellular base station (not shown), 

received by the transceiver of the cellular phone (not shown), and appUod to the input 1 fer 
an incoming fer eod signal as an input waveform W. In this example it is assumed that Ifae 
transmission back and forth between the system, in Ihis example a telephone system, and the 
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far end is in a digital form. If the original signals are in an analog form the system comprises 
an A-to-D converter to gmetate a digital far end signal which is then &d into input 1. 

As shown in FIG. 1» the waveform is applied in a digital format at input 1 of or 
connected to a DSP (digital sound processor) 2, which is connected or whidi comprises a 
5 digital output 3. The digital signal output is fed to and converted to an analog format by D-to- 
A converter 4 and amplified by amplifier 5 for use by the loudspeaker 6. A sound pressure 
wave Wl representative of the speedi of the distant party is emitted by loudspeaker 5. 
Accordingly, the radiotelephone user hears sound pressure wavefimns which are 
representative of the speech of the distant party. 

10 However, the listener does notjust hear the sound generated by the loud 

speaker, but also other sounds, which may make the sound generated by tlie loud speaker 

difQcuIt to understand, i.e. of low intelligibility. 

Turning xtp the volume seems a first and obvious choice to increase t)ie 

intelligibility. However, the maTrimimi output level of the loud speaker is ofien limited and 
1 5 suxq>ly tuniing up tlie volume often leads to more noise, not necessarily a better intelligibility 

ofthesignaL 

To improve the mtelligibility a number of co-operating measures are taken in 
the device and method in accordance with the invention. 

Fig. 2 illustrates very schematically a DSP (digital sound processor) for use in 

20 a device in accordance with the inventioa The DSP conqxrises a Ugh pass fQ^ wilfaa 
cut-off frequency between 300 Hz and 2 kHz, preferably between SOO and 1500, more 
preferably betwera 800 and 1 200 Hz. The high pass filter removes or reduces fiiequency 
components below tbe cut-off fiiequency f . 

Most of the energy of the signal has thereby been iraioved. This enables Hie 

25 remaining signal to be much more aniplified (befi>re running into problems in regards to a 
digital clipping, i.e. a value hig^» than the maviTnimi value). The anoplification is done by 
the compressing amplifi^. Within the concept of the invention a compressing amplifier is an 
amplifier which amplifies the signal but also levels the average sound level, i.e. sounds 
having a small amplitude are more amplified than sound having a high sound level, thus 

30 reducing the signal amplitude range. This may be done in several manners e.g. a 

clipper/limiter arrangement, a clipper/compressor or an AVL (automated voltage leveler) 
followed by a gain and a clipper. A number of different techniques may be used, including 
using look-i^ tables to perform the amplification and compression. The anoqplitude range, in 
particular the upper limit of the range may be set by the manufacturer or influenced by the 
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user e.g. by means of a loudness setter (a knob with which the user may set the loudness). 
Various examples will be given below. Compared to a straigjhtforward, linear amplification 
of the signal (i.e. for all sound levels an equal amplification &ctor) the compressed 
amplification leads to more intelligibility of the sound The words are more easily 

5 distinguishable, and thus the intelligibility of the sound is improved. It could also lead, 
however, to a distortion of the sound since the nonrlinear amplification of the sound 
introduced overtones (high firequency components at double, triple etc the original frequency) 
which leads to an increased harshness of the sound. This is perceived by the listener as being 
unpleasant, and in fact, in an important sense, reduces flie intelligibility, in a broader sense, of 

10 the vocal message, since the harshness of the spoken words often forms an iasportsnt aspect 
of the vocal message. This eflfect is present even without clipping, the clipping itself also 
introduces overtones. Intelligibility in a broader sense does not just relate to the words as 
such, but also to the message the speaker wishes to convey to the listener. The harshness of 
the sound, especially at higher average amplification, makes everybody sound angry, thus 

15 strongly reducing the finesses in emotions the speaker wishes to convey. The application of a 
low pass filter, as in a preferred embodiment of the device, depicted in figure 2, after the 
compressing amplifier reduces the perceived harshness of the voice, restoring at least to some 
extent the original emotional content of the spoken words, i.e. giving a much more natural 
sound. It is remarked that in most western languages the pitch of a word rnfluences the 

20 emotional icapact of the word, but not the meaning of the word per se. HowevCT, there are 
languages in which the pitch of the word plays a much larger role, leading to completely 
different meanings for one and the same Vord" depending on flie pitch of the word. When 
such languages are used (which cannot be excluded) the use of the low pass filter becomes 
even more advantageous. The invention is in particular of advantage i^en used in 

25 conjunction with or m automated speech recognition systems, especially for languages in 
which the pitch of the spoken word influences the meaning of the words. What is above 
discussed in relation to spoken words, i.e. voice, equally appUes when the sound reproduction 
system is used to reproduces music. Also in music the way music is perceived is of course 
dependent on whether one can hear the notes, but also the harshness of the sound is very 

30 irnportant The invention is thus, although of great in5)ortance to systems in which vocal 
messages are relayed such as telq)hone systems, not restricted to such systems, systems for 
reproducing music may equally benefit of this invention. 

The leveling or cong)ression action may be performed before HP fiyitering of 
the incoming signal or after it has been HP filtered. The an:q)lification per se (i.e. the gain) is 
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done after HP filtering. The clipping is done after Ifae gain or in conjunction. When use is 
made of a low pass filter this is positioned after the clipper. 

Figure 3 gives two exanq)les of high pass filters usable in a system in 
accordance with the invention. 

The left band side of the figure illustrates a 1 ^ oidec filter, the right hand side a 
second order filter. The shown big^ pass filter have a cut-off frequency f of approximately 1 
kHz. First or second order high-pass filters (which have relatively moderate sloops of S-IS 
dB per octave) are preferred. Removing too much of the low fiBquendes component results 
in a very unnatural sounding voice (or unnatural odd sounding music) Therefore the order of 
the bigh pass filter is preferably limited at 2. This also reduces the computational power 
required. Preferably the user can change the bigh pass filter from 1 ^ to 2*^ order and vice 
versa, or the system comprises an automatic switching mechanism dependent on the 
incoming signaL Using tbe 2'^ order results in higji speech intelligihility (in restricted sense, 
i.e. only the words) and/or signal loudness, wbereas the 1 * order HP filter will better preserve 
the natural sound of the original signaL 

Tbe HP filter may fi)r instance consist of a biquad wbose fiequency magnitude 
response is shown in fig. 3 and wbose coefficients are listed in Table I, according to the 
format 

1 + fljZ +^2^ ^ 

Table 1 - Fater coeffidrats of flie HP fflters depicted In fig. 3. 





border 




bo 
bi 
bi 


0.70710678118655 

-0.70710678118655 

0.00000000000000 


0.56903559372885 

-1.13807118745770 

0.56903559372885 


fli 

02 


-0.41421356237310 
0.00000000000000 


•0.94280904158206 
0.33333333333333 



The eflFects of application of a higjh pass filter can be seen in figure 4, 5 and 6. 
Figure 4 shows the waveform of a typical speech signal, figure 5 the waveform after having 
been filtered by the bigh pass filter (processed by a first order IIR high pass filter, cutoff 
frequency at 1 kHz). 
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Figure 6 shows the spedium of llie waveform shown in figures 4 and 5, the 
signal energy of the waveform of fig. 4 being depicted in the upper curve, flie signal energy 
of the wavefonn after the high-pass filter being depicted in the lower curve. As can be seen 
form figure 6 the signal energy of the waveform of fig. 4 is mainly concentrated in the lower 
5 fieguendes (below 1 kHz). 

These fiiequencies contribute mainly to the specific sound of the voice, but less 
to die speech intelligibility. This property fonns one aspect of the invention. 

By attenuating the lower firequendes, the signal amplitude will decrease 
significantly, as can be seen in figure 6, creating headroom to amplify the remaining signal 
10 which contains relatively more firequencies contributing to the speech intelligibility, as in 
clear from figure 6. 

When compressing and amplifying the speech signal afterwards, even when 
clipping it, the speech intelligibility will be better than without the use of the HP filter for 
mainly two reasons: 

IS - the signal contains relatively more fi^equencies contributing to the speech 

intelligibility 

the low frequencies are less hard clipped, resulting in less harmonics (due to 
clipping) disturbing the speech intelligibility 

Removing too much of the low fi^equencies, however, would results in a very 
20 unnatural sounding voice. Therefore, the HP filter is preferably only first order e.g. a 
Butterworfh (first order IIR) filter. This had the advantage of little computational power 
consumption. 

Figure 7 illustrates a detail on a system in accordance with an embodiment of 

the invention. 

25 The DSP comprises a HP filter (for instance one as illustrated in figure 3, in 

this example for instance a Butterworfh 1 ^ or 2 order fiDlter with cut-off frequency value of 
for instance 1 kHz), followed by an AGC, followed by a limiter/clipper, followed by a low 
pass filter (LP). 

In this example all audio streams may be mono. The sample rate frequency 
30 can be e.g. one of the following: 8 kHz, 1 1.025 IdHz, 16 kHz, 22.05 kHz, 32 kHz, 44.1 kHz or 
48kHz. 

In this exaniple the AGC (Automatic Gain Control) acts block based, meaning 
that the gain &ctor only changes per block. In this way the computational power is kept to a 

TTii'niTmmi . 
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Figure 8 depicts the fimctioiial block diagram of the AGC. The gain is 
co]]q)uted such that the RMS of the AGC ou^ut block is approximately equal to the KMS of 
the AGC ii^ut block. Th^fore» the relation of the input KMS over the oulput RMS (tills is 
the target value) is coinpared to the actual gain, and this gain is adjusted towards tiie target 
S value. The gain variations must be smoothed to avoid too much signal distortion due to local 
waveform discontinuities at the block edges. 

The limiter/clipper increases the loudness level while keeping signal distortion 
within bounds, preferably as low as possible. 

In this example the limiter/clipper is a sample-based gam. Ja common 
10 conq)ression techniques the gain is dependent on the available signal 

The limitei/clipper may use a so-called crest-fector to calculate the allowed 
gain. The crest-factor is in &ct fbe relation of the peak value and the running SMS value of 
tie signals. In the limiter/clipper, signals with a high crest-&ctor, e.g. a drum beats, are 
clipped without severe audible distortion. Doing this, the gain can remain at a higher level, 
IS reducing the '^utnping effect' of Goinmon con^ressors. 

The gain is calculated as follows in tbis exemplary embodunent: ' 
First of all, the running RMS (Root Mean Square) value of the input signal is coirputed This 
RMS value is a smoothed average based on the recent **history'' of the signal waveform. Then 
the peak value is calculated using a look-ahead time in order to anticipate to upcoming signal 
20 peaks. 

Witii the RMS and the peak value, the orest-factor is computed A so-csdled 
^depeak" factor is used to specify how hard the algorithm can dip pealsy signals (high values 
will yield more clipping). Afierwards the gain is computed and is conq)ared with tiie 
m a xim a l allowed gain, which can be set by the user, and the minimftl value of the two is 
25 chosen. Although not shown here the maximum allowed gain settmg can be an icput for the 
high pass filter, wherein the cut-ofif frequency is a fimction (or more in general one or more 
characteristics of the filter, which could apart &om the value of the cut-ofif firequency also or 
alternatively e.g. include switching firom a first order to a second order) of the mgnriTnimi 
allowed gain setting. 

30 The low pass filter LP filters the high frequencies with a more natural sound as 

- • » 

a consequence. Although not shown here the maximum allowed gain setting can be an input 
for the high pass filter, wherein the cutrofp frequency is a fimction (or more in general one or 
more characteristics of the filtrar, which could apart fix)m the value of the cut-off frequency 
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also or alternatively 6.g. include switching from a fiist order to a second order) of tiie 
maximum allowed gain setdng. 

Fig. 7 and 8 illustrate one type of embodiments of the invention, fig. 9 and 10 
are directed to a different type of embodiments. 



5 Basically these embodiment comprise 4, preferably 5 elements or steps: 

1 . an AVL (Automatic Volume Leveler) : the AVL is a signal dependant 
processing block, keeping the volume of the incoming sigoal at an approximately constant 
level, . 

2. a first order HP (high pass) filter : this filter removes a part of the lower 
10 fi:equ6ncies, creating headroom for amplification. 

3. a gain : increasing the SPL (Sound Pressure Level) of the signal 

4. a cUpper, preferably a hard clipper when a simple device is preferred: the 



signal is clipped at a certain amplitude, to assure linear operation of the analogue amplifier 
(after D/A conversion). Instead of a hard clipper, which simply clips the signal above the 

IS clipping level, a soft clipper may also be used, which chps the signal above a cUpping level 
but also attenuates the signal at level close to the clipping level. Using a sofi-chpper restores 
to some extent the dynaimc behavior of the signal, increasing intelligibility. 
5, (preferred) a first order LP (low pass) filter : die filter restores or at least 

improves the balance b^ween mid-range fiiequencies and high fi:equ6nci6S is unnatural, and 

20 the signal sounds rather harsh; this filter makes the processed sound more pleasant to listen 
to. 

In this exaiiq)le the input is a speech iiq)ut, but it is reniarked that the m^ 
may be any sound signal 

Figure 10 illustrates in more detail the AVL element 
25 Figure 10 shows the block diagram of the AVL. Division of the maximum 

output ainplitude (called "Heference^ClippiiigJLin" in Fig. 1 0) by the absolute value of the 
maximum amplitude of the signal across the entire input block yields the gain that can be 
applied to the block without any clipping (called 'Instant gain"). The parameter 
''GainUpdate" is typically 1/1000*^, Thus, the signal after the sum is increased each blodc 
30 with an amount dq)ending on the ^'Instant gain". Doing so, the AVL ^in will be increased 
faster for higher values of 'Tnstant gain'*. 

Since the evratual gain ^plied to the iqput signal ("^AVL gain'^ should not be 
higher than ("Instant gain"), the minimum of both is takea ('"Afin"). The minirnnin block 
also assures that the AVL gain is decreased immediately when the incoming signal amplitude 
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iiuxeases. To prevent lliat silent parts in Ike inpvA signal are aiq>lified too much, the signal 
after the Tninimum block is limited (typically 12 dB). Note thal^ since the maximum ou^ut 
amplitude is not necessarily equal to the linear level representing 0 dB, the AVL gain can be 
smaller than 1. 

5 Last, if the amplitude of the input signal is smaller than a certain valn^ for 

example Ike signal noise floor, no gain will be applied. This is a secondary functionality, in 
preferred embodiments, which can also be used as a fimctionality in other types of 
compressing amplifiers to prevrat small input signals fiom being amplified, noise is then not 
anoplified, winch increases the intelligibilily of the sound. 

10 Numeric exanmle: 

ji 

sanmle rate is 8 kEIz 
block length is 80 sanciples 
word length is 1 6 bit 

mflvi'nmTn output aooplitude is 32768 (0 dB) 
15 - input signal is sinusoidal witb amplitude 32768 (0 dB), fiiequency >= 100 Hz 

(at least one sine period per blodc) 

cunent AVL gain is 1 (0 dB) 
GainUpdate^ 0.001 

AVL gain C'Old value") is limited at 12 dB 
20 Now, the input signal amplitude decreases by 40 dB (fector 1 00 on ai linear 

scale). The table below shows how the different signals inside the AVL change with time. 
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25 The AVL behavior is also shown in figure 11 
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The left graph shows the step change of the amplitude of three input signals. 
The right graph shows that the AVL gain increases fester for large changes in amplitude. This 
is a preferred mibodiment furllier improving intelligibility. 

Fig. 12 shows an example of a compression function. Upto a certain value the 
5 signal is left untreated (multiplied by one) and above that value the outputted signal 
amplitude is smaller than the inputted signal amplitude, i.e. the signal is compressed. A 
fimctional form whidi only applies compression in the highest regions as in Fig. 12 may be 
used for the clipper, whereas a fimctional form which provides a compression strategy over a 
larger region (e.g. the whole input signal range) may be used for the compressing amplifier. 
10 The invention may be used in various devices. The invention is in particular 

usefiil fer hands jfiee mobile phones. However, it is applicable for all sound reproduction 
systems, especially those which run on a device vwth a limited voltage supply and/or small 
loudspeaker. A list of possible applications: 

handsets (mobile phone, DECT, etc.); 
15 - portable devices, e.g. a portable DVD player 

PDA'S; 
car-kits 

TVs; computers; 

web-terminals; 
20 - answering machines; 

It will be appreciated by persons skilled m the art that the present mvention is 
not limited by what has been particularly shown and described hereinabove. The invention 
resides in each and every novel characteristic feature and each and every coihbination of 
characteristic features. RefCTence numerals in the claims do not limit their protective scope. 
25 Use of the verb "to comprise" and its conjugations does not exclude the presence of elements 
other than those stated m the claims. Use of the article "a" or "an" preceding an element does 
not exclude the presence of a plurality of such elements. 

The presCTt invention has been described in terms of specific embodiments, 
which are illustrative of the invention and not to be construed as luniting. The invention may 
30 be inqilemented m hardware, firmware or software, or in a combination of them. Other 
embodiments are within the scope of the following claims. 

For instance in the examples given so fer all or most of the data and settings 
for the various elements are fixed. However, witiiin preferred embodiments of the invention 
the settings of the various elements are made adaptive to a measurement of enviromnental 
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noise. Figure 13 illustrates such as example. Using a measuring device 130 for measuring 
background noise, such as e.g. a separate microphone, and an adaptor 13 1 the settings for one 
or more of the different elements (AVL (gain), HP-Fnter (cut-off ftequency). Gain (gain G), 
Hard CHpper (CUpping level), LP Filter (cut-off ftequency)) are adapted. 

5 The different adaptations are labeled 132 to 136. 

The noise measuremrat may give a single data e^q^iressing the ov0:all noise S 
or may give a noise figures for different noise band Sn, Scz, Sq etc. If noise figures are 
measured for different noise bands an average or total noise may be calculated for instance 
Sair=ZSfi, or weighted according to a dB(A) scale for instance Sa/=SwiSfi ^ere wi are 

10 weighting coefficients of the dB(A) scale. The noise level is measured by an amplitude 
measurement 

The different adaptations will be discussed by means of examples. 
132: Adaptations of the AVL settings: 

15 

In the AVL the AVL gain is limited (see figure 1 0, AVL gain limit), and the 
below a certain level no amplification is ^lied (see figure 10, box below which is stated no 
amplification, deternuning the noise floor witch). Depending on the measured noise S the 
noise floor switch is increased and the limit to the AVL gain is ino^^ 

20 

133: Adaptations to the HP filter 

The cut-off frequency is adapted to tiie measured noise. The higher the noise 
level the larger the cut-off frequmcy. For ad^tive embodiments the cut-off fi»quen<qr may 
25 advantageously range over a broader rang^ than for non-adaptive embodiments. Whereas for 
non-adaptive embodiments the cut-off fiiequency advantageously ranges between 300 Hz and 
2 kEIz for adaptive embodiments the cut-off frequency advantageously range betwem SO Hz 
(for situations m vMdx there is substantially no noise) to typically iqi to 2 kHz for high- 
noise-level situations. 

30 In adaptive processing mode, the filter cut-off frequency is thus updated 

according to the amount of mviromnental noise, and ranges typically from a very low value 
e.g. 50 Hz (no environmental noise) and 2 kHz (loud environmeDtal noise). More low 
fisquencies are removed m loud noisy environments to create more headroom to amplify the 
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signal afterwards. A maximum of 2 kHz is recommended to avoid removal of fiequencies 
contributing to the speech intelligibilily. The filter coefficients are calculated at run-time. 

The relation between cut-oflF frequency to noise level is preferably set as 
follows: fcut^£pfo+Af (S) where fo is the low noise limit (e.g. 50, 100 or 300 Hz) and Af is a 
5 higher than linear (proportional to S' where i is greater tiban 1) jHrnction of the noise level. 

In more complex embodiment not only the cut-off frequency may be 
dependent on the measured noise level but also the shape of the filter function, e.g. whether 
Ihe filter fimction is a first order or a second order filter fimction. 

10 134: Adaptations for the gain G. 

The gain G is increased as the noise level increases. However, the g^in 
preferably does not increase linearly with the noise level but rather as follows: 
G(S)=Go + cS^ where i is more than 1, e.g. 2 or 3, and c is a constant where c maybe zero 
1 5 below a threshold value of S. A simple relationship is for instance G(S)=Go + cS^ i.e. the 

gain does not dq)end Imearly on the noise, but stays at or close to the nominal 
increase rapidly at high noise levels. 

The increase in gain has a positive effect; but also a negative. By in^easing 
the gain non-linearly as a non-linear fimction of noise a better balance between the positive 
20 and negative effects may be obtained. 

The value of the gain is e.g. adapted to the amount of environmental noise (in 
adaptive processing mode) ranging fix)m OdB to +18 dB. 

135: Adaptations for the clipper: 

25 

The clipping action is increased as the noise level increases. This is preferably 
a non-linear function of Ihe noise. The maximum level above whidi the signal is clipped may 
be reduced by 12 dB for high noise levels. For instance without noise Hie digital clipping 
level may be 0 dB, whereas at higji noise levels clipping is set at -12 dB. 

30 

136: Adaptations for the £x)w pass filter. 

In the absence of noise the cut-off frequency is set at the hi^ limit e.g. Fg/2. 
For m a ximum noise the cut-off frequency is set at the lower limit, e.g. 2 kfife. 
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In adaptive processing mode, the filter cutoff fiequency is updated according 
to the amount of environmental noise, and ranges e.g. fixim 3.900 kHz (no environmental 
noise) to 2 kHz (loud environmental noise) for narrow-hand speedi applications (sample rate 
is 8 kHz). The fUiter coefficients are calculated at run-time. 
5 Preferably the functional relationship between the measured noise level and 

the culroff frequency is ^art:^ffLp=ftLP-AfLP where Afip is a non-linear function of tiie 
measured noise level, e.g. proportional to the quadrature of the noise level. 

hi these embodiments the algorithm used is designed to operate adaptively, 
driven by the amount of environment (near-end) noise. This results in a user-friendly device 
10 feature that aUows the device user to use its device (e.g. a GSM) in varying conditi^^ 
concerning environmental noise, without the need of any further interaction to control the 

GSM volume level. 

When used in adaptive processing mode, the parameters of the processing 
blocks are adapted iar incoming samples according to flie environmental noise. The 
IS algorithm adapts parameters according to the environmental noise. The amount of noise may 
be measured by a separate microphone or estimated using the device (GSM) microphone (for 

a single microphone application). 

The main focus of the invention is and rmiains speech intelligibility. Hus is 
given hi^ier priority than naturahiess and warmtii of the voice. When tiie environmental 
20 noise decreases in vohune (operating in adaptive processing mode), tiie parameters are 

preferably ad^ted very quickly such that tiie naturahiess and wannth of the incoming signal 
are restored. 

In the adaptive mode tiie amount of CTvironmental noise influences the 
different processing block parameters. The noise may range rangmg from *no environmental 
25 noise' to *loud enviroomental noise' . 

The temi *no enviroimiental noise' does not mean complete silence, but 
regular noises such as fen noise, background music, eto. In a typical environment, the 
background noise is typically around 50 dB(A). The term *loud envitonmmtal noise' refers 
to the noise of a passing train or siibway, noise inside a dance club, eto. These noises can 

30 measure up to 100 dB(A). 

The noise may be measured by measuring spectral amplitude information of 
the environmental noise and calculating a single value representing the amount of noise. One 
or more, preferably most, most preferably all of the parameters of the different elements 
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(AVL, HP-filter, gain, clipper, LP-filter) are adapted according to this single value, by 
interpolation between their minimum and maximum values* 

As explained above this interpolation is not necessarily linear. Tests using 
linear interpolation showed that for 'medium' environmental noises, the algorithm effect was 
S too large. 

Using a higher, for instance second or third order interpolation, the effect is 
smaller compared to linear interpolation for the same environmental noise. For loud 
environmental noises, the amount of effect is equal. 

Figure 14 illustrates the adaptive behavior of a parameter (for instance the gain 
10 G, a cut — off frequency, clipping level). The parameter P varies in this exanople between a 
lower limit value Ptow, in this example 2, to a high limit value Pu^h > in this example 7, as a 
function of the measured noise level, in this example the average noise anq)litud6 S. The 
different curves labeled 1, 2 and 3 give first (Linear), second (quadratic) and third order 
dependency of the parameter on the measured noise level The higher die order die more 
1 5 pronounced die adaptive effect is at high noise levels. 

In short die invention can be described as follows: 

A sound reproduction system comprises a digital audio signal input (1), a 
digital audio signal processor (2, DSP) and a digital audio signal ou^ut (1) wherein the 
digital signal processor (2, DSP) comprises a high pass filter (21) with a pass frequency (f) of 
20 between 300 Hz and 2 kHz, a coiEigpression amplifier (22) for compression and amplification 
of a signal to within a signal aniplitude range and a clipper for clipping the signal above a 
clipping level, and preferably a low pass filter (23) for filtering the signal provided by the 
compressing amplifier and for providing an ou^ut signal, die pass firequenc^ of the low pass 
filter (f ) lying in die range 2 kHz-Fs/2 where Fs is the sampling frequency. 
25 Fs/2 may be 6.g. 4 kHz. 

Many furdier variations are possible widiin the concept of die invention. 
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CLAIMS: 



10 



1. A sound reproduction system comprising a digital audio signal input (1), a 
digital audio signal processor (2, DSP) and a digital audio signal output (l).wherein the 
digital signal processor (2, DSP) comprises a high pass (HP) filter (21) with a pass frequency 
(f) of between a first and a second frequency, a con^pressing amplifier (22) &r compression 
and anq)lification of a signal, at least amplification being performed after HP filtering, and a 
clipper for clipping the HP filtered, compressed and amplified signal above a clipping level 

2. A sound reproduction system as claimed in claim 1 , wherein the pass 
fbequency (f) is a frequency between 300 Hz and 2 kEIz. 

3. A sound reproduction system as claimed in claim 1 , wherein Ihe high pass 
filter is a first order or second order filter. 



4. A sound reproduction system as claimed in claim 1 wherein fhe device 

15 comprises a measuring device (130) for measuring background noise and an adaptor 131 for 
adapting one or more parameters (f, order) for the high pass filter (HP). 

5. A sound reproduction system as claimed in claim 4, wherein the pass 
fi:equency is adaptable between 50 and 2 kHz. 

20 

6. A sound reproduction system as clauned in claim 1, wherein the compressing 
amplifier is arranged not to amplify a signal having a signal strength below a threshold value. 

7. A sound reproduction system as claimed in claim 1 , wherein the device 

25 comprises a measuring device 130 for measuring background noise and an adaptor 13 1 for 

* 

adapting one or more parameters for the compressing amplifier (22). 

8. A sound rqiroduction system as claimed in claim 1 , wherein the digital audio 
processor comprises a low pass filter (23) for filtering the signal provided by the 
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compressing amplifier and for providing an output signal, the pass frequency of the low pass 
filter (f ) lying in the range 2 ldIz-Fs/2 wheie Fs is the sampling fiiequency. 

9. A sound rqnoduction system as claimed in claim 8, wherein the device 

5 comprises a measuring device 130 for measuring background noise and an adaptor 131 for 
adapting one or more parameters (f) for the low pass filter. 

10. A sound rqiroduction system as claimed in claim 9, wherein the system 
coniprises a means for activation and/or setting of Ifae firequency dependence of the low pass 

10 filter in dependence on the aniplification in the compressing amplifier. 

11. A sound reproduction system as claimed in any of the claims 4, 7 or 9, 
wherem the one or more of the said parameters is a non-linear function of the measured noise 
leveL 

15 

12. . A soundreproduction system as claimed in claim 1, wherein the system 
conq)rises the high pass filter jfollowed by an AGC followed by a limiter/clipper. 

13. A sound rqiroduction system as claimed in claim 1, wherein the system 
20 comprises an automatic volume leveler preceded, or preferably, followed by the high pass 

filter, providing a leveled signal, followed by a gain and a clipper. 

14. A method far processing digital sound signals in which method fi:equency 
component of die sound signal lower than a cut-off frequency (f) between a first and a 

25 second fiiequency are attenuated, the sound signals are amplified and compressed to within a 
signal band width and clipped above a clipping level within the signal band width. 

15. A noiethod as claimed in claim 13, "Vfdierein the cut-off fi^quency is between 
300 Hz and 2 kHz. 

30 

16. A method as claimed in claim 13 wherein a noise level is measured and the 
cut-off fi'equency (f) is determined in dependence on the measured noise level (S). 
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17. A meliiod as claimed in claim 16, wherein the cut-off frequencgr is detemiined 
by a non-linear function of the noise level (S). 

18. A method as claimed in claim 16, wherein the cu^off fiequency ranges 
5 between 50 Hz and 2 kHz. 

19. A method as claimed in claim 13, wh^ein after compression and clipping 
frequency components of the resulting digital signal below a cut off fieqaency f between 2 
and 4 IdEIz are attenuated. 

10 

20. A method as claimed in claim 19 wherein a noise lefvel is measured and flie 
cut-off fieqaency (f) is determined in dependence on the measured noise levpl (S). 

21 . A method as claimed in claim 20, wherein the cut-off fiequency (f ) is 
15 determined by a non-linear fanction of the noise level (S). 

22. A method as claimed in claim 13, wherein activation and/or setting of the 
fieqaency dependence of the low pass filter is performed in dependence on the amplification 
in the leveling ampUfication step. 

'20 

* 

23 . Computer program comprising program code means for performing a method 
as claimed in any one of claims 13 to 22 when said program is run on a computer. 

24. Computer program product comprising program code means stored on a 
25 computer readable medium fi)r performing a method as claimed in any one of claims 13 to 

22. 



PHNL040024EPP 

23 19.01.2004 

ABSTEIACT: 



A sound leprochiction system conprises a digital audio signal input (1), a 
digital audio signal pioc^sor (2, DSP) and a digital audio signal oulput (1) wherein tiie 
digital signal processor (2, DSP) comprises a high pass filter (21) with a pass frequency (f) of 
hetween first and a second fi:equencyy e.g. between 300 Hz and 2 kHz, a compression 
5 amplifier (22) for compression and an:q)lification of a signal to within a signal amplitude 
rangp and a clipper for clipping the signal above a clipping level, and preferably a low pass 
filter (23) for filtering the signal provided by Ihe conxpressing anq)lifier and for providing an 
output signal, the pass fi:equency of the low pass filter (f ) lying in the range 2 kEIz-Fs^ 
where Fs is the sanipling fiequenQr. 
10 Fs/2 may be 6.g. 4 kHz. 

The parameters of the various elements such as cutoff frequency, order of the pass filter, 
gain, eto are preferably dependent on a measured noise leveL 

Fig. 2 
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